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Genome Sequence of Streptomyces viridosporus Strain T7A ATCC 
39115, a Lignin-Degrading Actinomycete 
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We announce the availability of the genome sequence of Streptomyces viridosporus strain T7A ATCC 391 15, a plant biomass- 
degrading actinomycete. This bacterium is of special interest because of its capacity to degrade lignin, an underutilized compo- 
nent of plants in the context of bioenergy. It has a full complement of genes for plant biomass catabolism. 
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In the search for fossil fuel alternatives, much effort has been 
devoted to the discovery and development of microorganisms 
that can efficiently convert plant biomass into biofuels and com- 
modity chemicals (1,2). Many microorganisms have been identi- 
fied that can degrade the carbohydrate components in plants. 
However, few are known that also depolymerize and consume 
lignin, a structural polymer in plant cell walls. One such organism 
is Streptomyces viridosporus strain T7A ATCC 39115, a soil- 
dwelling actinomycete (3-7). Importantly, it is the source of the 
first bacterial peroxidase that depolymerizes lignin (8-12). The 
absence of information on the genes encoding this lignin peroxi- 
dase and other enzymes responsible for lignin degradation and 
their biotechnological potential motivated our efforts to sequence 
the S. viridosporus genome. 

The draft genome sequence of S. viridosporus was generated at 
the Department of Energy (DOE) Joint Genome Institute (JGI) 
using a combination of Illumina (13) and 454 technologies (14). 
The Illumina GAii shotgun library generated 67,837,180 reads to- 
taling 5,155.6 Mb. The 454 Titanium standard library generated 
228,388 reads, and the paired-end 454 library (with an average 
insert size of 8 kb) generated 635,872 reads, totaling 223.4 Mb of 
454 data. The 454 data were assembled together with Newbler, 
version 2.3, while Illumina sequencing data were assembled with 
Velvet, version 1.0.13 (15). The 454 and Illumina assemblies were 
integrated using parallel Phrap, version SPS-4.24 (High Perfor- 
mance Software, LLC). Illumina data were used to correct poten- 
tial base errors and increase consensus quality using the software 
Polisher developed at the JGI (A. Lapidus, unpublished). Possible 
misassemblies were corrected using gapResolution (C. Han, un- 
published) or Dupfinisher (16), or sequencing cloned bridging 
PCR fragments with subcloning. Gaps between contigs were 
closed by editing in Consed (17-19), by PCR, and by bubble PCR 
(J.-F. Cheng, unpublished) primer walks. A total of 442 additional 



reactions were necessary to close gaps and to raise the quality of 
the finished sequence. The estimated genome size is 8.3 Mb and 
the final assembly is based on 163.6 Mb of 454 draft data, which 
provide an average 19. 7X coverage ofthe genome, and 5,006.8 Mb 
of Illumina draft data, which provide an average 603. 2X coverage 
of the genome. 

The total genome size is 8,278,598 bp with a G+C content of 
72.5%. Prodigal software (20) and the JGI GenePRIMP pipeline 
(21) were used to identify 7,552 candidate protein-encoding 
genes. Annotations using the NCBI nonredundant database, Uni- 
Prot, TIGRFam, Pfam, Priam, KEGG, COG, and InterPro data- 
bases were completed and the results can be accessed at http://img 
.jgi.doe.gov. 

The genome contains numerous genes encoding homologs of 
enzymes that deconstruct plant biomass. COG annotation 
showed that 8.35% of the predicted proteins are involved in car- 
bohydrate transport and metabolism. Genes encoding putative 
lignin-degrading enzymes, such as heme peroxidases, Dyp-type 
peroxidases, and catalases, were identified and are currently being 
analyzed. Pathways for the catabolism of lignin-derived aromatic 
compounds, such as protocatechuate, benzoate, and catechol, 
were also identified. Based on these findings, we anticipate that 
S. virdosporus or its complement of genes for plant catabolism 
could constitute a lignocellulose biorefinery. 

Nucleotide sequence accession number. The genome se- 
quence of Streptomyces viridosporus has been deposited in Gen- 
Bank under accession no. AJFD00000000. 
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